Information-Theoretic Representation Learning for Positive-Unlabeled Classification

نویسندگان

چکیده

Recent advances in weakly supervised classification allow us to train a classifier only from positive and unlabeled (PU) data. However, existing PU methods typically require an accurate estimate of the class-prior probability, which is critical bottleneck particularly for high-dimensional This problem has been commonly addressed by applying principal component analysis advance, but such unsupervised dimension reduction can collapse underlying class structure. In this paper, we propose novel representation learning method data based on information-maximization principle. Our does not estimation thus be used as preprocessing classification. Through experiments, demonstrate that our combined with deep neural networks highly improves accuracy estimation, leading state-of-the-art performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Positive Unlabeled Learning for Data Stream Classification

Learning from positive and unlabeled examples (PU learning) has been investigated in recent years as an alternative learning model for dealing with situations where negative training examples are not available. It has many real world applications, but it has yet to be applied in the data stream environment where it is highly possible that only a small set of positive data and no negative data i...

متن کامل

Ensemble Based Positive Unlabeled Learning for Time Series Classification

Many real-world applications in time series classification fall into the class of positive and unlabeled (PU) learning. Furthermore, in many of these applications, not only are the negative examples absent, the positive examples available for learning can also be rather limited. As such, several PU learning algorithms for time series classification have recently been developed to learn from a s...

متن کامل

Positive Unlabeled Leaning for Time Series Classification

In many real-world applications of the time series classification problem, not only could the negative training instances be missing, the number of positive instances available for learning may also be rather limited. This has motivated the development of new classification algorithms that can learn from a small set P of labeled seed positive instances augmented with a set U of unlabeled instan...

متن کامل

Prototype Based Classification Using Information Theoretic Learning

In this article we extend the (recently published) unsupervised information theoretic vector quantization approach based on the Cauchy–Schwarz-divergence for matching data and prototype densities to supervised learning and classification. In particular, first we generalize the unsupervised method to more general metrics instead of the Euclidean, as it was used in the original algorithm. Thereaf...

متن کامل

Positive-Unlabeled Learning for Pupylation Sites Prediction

Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Computation

سال: 2021

ISSN: ['0899-7667', '1530-888X']

DOI: https://doi.org/10.1162/neco_a_01337